Theoretical Analysis of Density Ratio Estimation

نویسندگان

  • Takafumi Kanamori
  • Taiji Suzuki
  • Masashi Sugiyama
چکیده

Density ratio estimation has gathered a great deal of attention recently since it can be used for various data processing tasks. In this paper, we consider three methods of density ratio estimation: (A) the numerator and denominator densities are separately estimated and then the ratio of the estimated densities is computed, (B) a logistic regression classifier discriminating denominator samples from numerator samples is learned and then the ratio of the posterior probabilities is computed, and (C) the density ratio function is directly modeled and learned by minimizing the empirical Kullback-Leibler divergence. We first prove that when the numerator and denominator densities are known to be members of the exponential family, (A) is better than (B) and (B) is better than (C). Then we show that once the model assumption is violated, (C) is better than (A) and (B). Thus in practical situations where no exact model is available, (C) would be the most promising approach to density ratio estimation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Direct Density Ratio Estimation for Non-stationarity Adaptation and Outlier Detection

We address the problem of estimating the ratio of two probability density functions (a.k.a. the importance). The importance values can be used for various succeeding tasks such as non-stationarity adaptation or outlier detection. In this paper, we propose a new importance estimation method that has a closed-form solution; the leave-one-out cross-validation score can also be computed analyticall...

متن کامل

تعیین مدل تجربی برآورد بده اوج لحظه‌ای چند حوزهْ آبخیز غرب ایران

Peak discharge is one of the basic parameters in the design of hydraulic structures. There are various methods for peak discharge determination. Regional flood frequency analysis is based on physical, climatological and hydrological characteristics of basins. The objective of this study is to examine different models for the estimation of quantiles for some catchments in western Iran (namely: G...

متن کامل

تعیین مدل تجربی برآورد بده اوج لحظه‌ای چند حوزهْ آبخیز غرب ایران

Peak discharge is one of the basic parameters in the design of hydraulic structures. There are various methods for peak discharge determination. Regional flood frequency analysis is based on physical, climatological and hydrological characteristics of basins. The objective of this study is to examine different models for the estimation of quantiles for some catchments in western Iran (namely: G...

متن کامل

Density Ratio Estimation in Machine Learning

Machine learning is an interdisciplinary field of science and engineering that studies mathematical theories and practical applications of systems that learn. This book introduces theories, methods, and applications of density ratio estimation, which is a newly emerging paradigm in the machine learning community. Various machine learning problems such as non-stationarity adaptation, outlier det...

متن کامل

Estimation of Density using Plotless Density Estimator Criteria in Arasbaran Forest

    Sampling methods have a theoretical basis and should be operational in different forests; therefore selecting an appropriate sampling method is effective for accurate estimation of forest characteristics. The purpose of this study was to estimate the stand density (number per hectare) in Arasbaran forest using a variety of the plotless density estimators of the nearest neighbors sampling me...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEICE Transactions

دوره 93-A  شماره 

صفحات  -

تاریخ انتشار 2010